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(54) TiUe: ANIMATION OF THREE-DIMENSIONAL CHARACTERS ALONG A PATH 
(57) Abstract 

A character is repre- 
sented in a character generator 
as a set of polygons. The 
character may be manipulated 
using three-dimensional 
animation techniques. A 
code for a character may be 
used to access a set of curves 
defining the outline of the 
character. This set of curves 
is transformed into a set of 
polygons. The set of polygons 

may be rendered as a three-dimensional object. The set of polygons may be created by converting the curves into^ sets of connected line 
segments and then tesscUating the polygon defined by the line segments. Animation properties are represented using a normalized scale 
along a path or over time. Animation may be provided in a manner that is independent of the spatial and temporal resolution of the video 
to which it is applied. Such animation may be applied to characters defined by a set of polygons. Various three-dimensional spatial 
transformations, lighting effects and other colorizations may be provided. A user interface for editing a character string may provide two 
alternate displays. A first display allows a user to input and view any desired portion of the character string for the purpose of editing. A 
second display allows a user to view how the character string appears at a selected point in dme during a titling effect for the purpose of 
animation. In both displays, the text is displayed in a three-dimensional form. This interface may be combined with a timeline editing 
interface for editing an associated video program, or other user interface, to permit layering of titling effects and adjustment of animation 
properties, positioning and timing. 
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ANIMATION OF THREE-DIMENSIONAL CHARACTERS ALONG A PATH 

5 Character generators are systems that are used to add text, such as titles and credits, to 

video programs, such as a television, film, video and other multimedia programs. Titles often 
are designed to roll or crawl over a screen, commonly called scrolling . Other effects 
commonly are performed. Some character generators focus on providing real-time 
multicharmel mixing of titling and video. Some provide more advanced creative features 

10 which are not producible in real-time. 

Most computer systems which render alphanumeric character strings, such as word 
processors or character generators, generally represent the character string as a set of 
characters, for example by using a tree or an array. Each character is defined by a code that 
refers to a bit-mapped image or raster image of the character. Alternatively, the code may 

1 5 refer to a set of curves defining the outline of the character which is converted to a bit-map or 
raster image. The data structure representing the character string is processed to layout each 
character in a plane according to the font, the character metric, the character size and the 
spacing between characters. The characters are then drawn in the image by placing the image 
of the character at a designated position in the plane. The plane is then displayed, possibly 

20 with some spatial transformation. Because character generators generally use bit-mapped 
images or raster images of text to add characters to video, spatial transformations performed 
on the characters may result in visual artifacts that are undesirable, such as pixellation. 

Several systems are available through which three-dimensional models of characters 
may be generated. One such system is shown in U.S. Patent No. 5,805,783 (EUson). 

25 Additional systems are described in "Designer Letters," Computer Graphics , November 
1991 . Most three-dimensional character generation systems require either complex three- 
dimensional model generation software, in which animation is provided through the same 
mechanisms that other three-dimensional models are animated. In such systems it is difficult 
to provide the real time editing and viewing of these three-dimensional effects. 

30 

SUMMARY 

A character is represented in a character generator as a set of polygons and may be 
manipulated using three-dimensional animation techniques. A code for a character may be 
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used to access a set of curves defining the outline of the character. This set of curves is 
transformed into a set of polygons. The set of polygons may be rendered as a three- 
dimensional object. The set of polygons may be created by converting the curves into sets of 
connected line segments and then tessellating the polygon defined by the line segments. 

5 Properties of a character may be animated over time or along a path. Animation over 

time may use a normalized scale over time to allow animation to be provided in a manner that 
is independent of the spatial and temporal resolution of the video to which it is applied. 
Properties of a character also may be a function of position of the character on a path. The 
position of the character on the path may vary over time or may be static. Two paths or rails 

10 also may be used to define the spatial properties of characters. Such animation may be 
applied to characters defined by a set of polygons. Various three-dimensional spatial 
transformations, Ughting effects and other colorizations may be provided. These effects may 
be defined by key fi-ames in an editing system used to create video program. 

A user interface for editing a character string may provide two ahemate displays. A 

15 first display allows a user to input and view any desired portion of the character string for the 
purpose of editing only the characters, but displaying them in three dimensions. A second 
display allows a user to view and edit how the character string appears at a selected point in 
time or position along a path during a titling effect for the purpose of animation. In both of 
these displays, the text is displayed in a three-dimensional form. This interface may be 

20 combined with a timeline editing interface for editing an associated video program, or other 
user interface, to permit layering of titiing effects and adjustment of animation properties and 
timing. 

Accordingly, in one aspect, a computer-implemented method and cornputer system 
generates a representation of an alphanumeric character as a set of polygons defining a 

25 surface of the alphanumeric character. A computer program product containing computer 
program instructions for performing the method also may be provided. An input providing 
an indication of a character code identifying the alphanumeric character is received. Curves 
defining an outline of the alphanumeric character are identified according tc the character 
code, A set of polygons defining a surface of the alphanumeric character are generated from 

30 the identified curves for the character code. The set of polygons may be generated by 
converting the curves into a set of connected line segments and tessellating the set of 
connected line segments to obtain the set of polygons. A user may define a path on which 
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one or more alphanumeric characters is placed. The properties of the characters may depend 
on their positions along the path. 

In another aspect, a method and computer system produces three-dimensional 
animation of alphanumeric characters. An input defining an outline of the alphanumeric 
character is received. Properties are associated with the character. A period of time during 
which the character is displayed or a path on which the character is placed may be defined. 
The properties associated with the character may vary according to the point in time in the 
defined period of time or the position of the character on the defined path. A set of polygons 
defming a surface of the character is generated using the received outline and properties. The 
set of polygons is rendered on a display according to the properties as specified over the 
period of time or along the path. 

In another aspect, a graphical user interface for displaying three-dimensional 
animated alphanumeric characters receives an indication of an alphanumeric character. In 
response to a receipt of an indication of an alphanumeric character, a set of polygons 
representing the alphanumeric character is generated according properties associated with the 
alphanumeric character. The set of polygons is rendered on a display. The properties may be 
determined according to a position of the character along a path, or as a ftmction of time. 

Another aspect is a video program produced by the process of generating a three- 
dimensional model of animated characters along a path or over time. 

BRIEF DESCRIPTION OF THE DRAWINGS 
In the drawings, 

Fig. 1 is a data flow diagram of a system for three-dimensional character generation in 
combination with a video editing system in one embodiment; 

Fig. 2 is a more detailed data flow diagram of a system for generating and rendering 
three-dimensional models of alpha numeric characters; 

Fig. 3 is a graphical illustration of a relationship of data structures defining a titling 
effect to be applied to video; 

Fig. 4 is a data flow diagram of a glyph manager shown in Fig. 3; 

Fig. 5 is a flow chart describing how the data structure shown in Fig. 3 may be 
processed to determine properties associated with each node in the structure; 

Fig. 6 is a flow chart describing how a character may be rendered; 
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Fig. 7 is a flow chart describing how a level of detail may be selected for defining a 
set of polygons to represent a character; 

Fig. 8 is a data flow diagram illustrating how a property value may be selected for a 
given frame of an effect according to a point in time in the effect; 
5 Fig. 9 is a data flow chart illustrating how a property value may be selected for a 

given character, according to the position of a character along a path; 

Fig. 10 is a flowchart describing how characters may be laid out along a path; 

Fig. 1 1 is an illustration of a graphical user interface for a timeline; 

Fig. 12 is an illustration of a graphical user interface for editing and viewing three- 

10 dimensional characters; 

Fig. 13 is an illustration of a graphical user interface for viev^ng three-dimensional 

characters in roll animation; 

Fig. 14 is an illustration of position dependent animation of scaling of text; 
Fig. 15 is an illustration of position dependent animation of y-axis rotation of text; 
15 Fig. 16 is an illustration of text on two parallel paths with a horizontal orientation; 

Fig. 17 is an illustmtion of text on non-parallel paths v^th a vertical orientation; and 
Fig. 1 8 is an illustration of text on non-parallel paths v^th a horizontal orientation. 

DETAILED DESCRIPTION 

20 The following detailed description should be read in conjunction with the attached 

drawing in which similar reference numbers indicate similar structures. All references cited 
herein are hereby expressly incorporated by reference. 

Referring now to Fig. 1, a character generator may be used in conjunction with, or 
independently from, a video editing system. A character generator receives alphanumeric 

25 character input from which image data is generated to be applied to the video data. An 

alphanumeric character is a graphical symbol defining a letter, number, punctuation or other 
symbol in a written language. In an embodiment shown in Fig. 1, the character generator is 
provided in conjunction with a video editor, which enables titling effects to be created by an 
editor along with a video program. The video program ultimately may be delivered as film, 

30 videotape, digital or analog, high definition or standard definition television signal or may be 
reproduced on a computer screen and/or stored on a computer readable medivim. In this 
embodiment, the character generator and video editing system have a graphical user interface 
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20 which receives user input 22 in order to edit a video program and to apply titling effects to 
the video stream. 

The graphical user interface generates display data 24 for placement, for example, on 
a computer display (not shown). The display data may include video data based on images of 

5 a video program in which titling effects are applied. Various user interfaces, such as a 

timeline or menus, in combination with a cursor control device or other input device, enable a 
user to define the input 22 applied to the graphical user interface 20. The user may input 
information such as an alphanumeric character string 26 to be used in a titling effect applied 
to the video data. Manipulations to the timeline, as shown at 28, are provided to a video 

1 0 editing system 30 which maintains a representation of the video program being edited, 
commonly called a composition. The video editing system outputs data representing the 
timeline as indicated at 32 which is processed by the graphical user interface to be added to 
the display data 24. The alphanumeric character string is input to a three-dimensional 
layout and rendering module 34. A character may be associated with properties 36 defining 

15 characteristics of the character such as a font, rotation, position, size, kerning and lighting. 
The three-dimensional layout and rendering module 34 uses the properties 36 and the 
alphanumeric character string 26 to generate a set of polygons defining the characters. The 
sets of polygons are rendered to produce three-dimensional character data 38 which is 
included in the display data 24 through the graphical user interface 20. 

20 Additional details of the three-dimensional layout and rendering module 34 are shown 

in Fig. 2. Fig. 2 illustrates a three-dimensional model generator 40 which receives an 
indication of an alphanumeric character 42 and the properties 36. There may be one or more 
alphanumeric characters 42 in the alphanumerical character string 26 in Fig. 1 , which are 
rendered separately by the module shovm in Fig. 2. Three-dimensional model generator 40 

25 outputs a set of polygons 44 that defines the alphanumeric character in three-dimensions. 
This character is rendered using a three-dimensional rendering module 46 to produce the 
display data for the character 48. The display data for several characters is combined in a 
layout determined using standard techniques to produce the three-dimensional character data 
38 in Fig. 1. 

30 By representing a character as a set of polygons which is rendered in three- 

dimensions, rather than a raster image, several transformations may be performed on the 
character in real-time to provide a displayed output to the editor illustrating how the character 
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appears in three-dimensions. Because the character is represented as a polygon, various 
pixellation or other visual artifacts are not created by the spatial transformations. The 
characters also may be animated over time or along a path and the animation may be defined 
in a resolution independent manner. Animation signifies varying any property, such as 
5 defined below, of three-dimensional object over time or along a path. 

One embodiment of the system of Fig. 1 will now be described in connection with 
Figs. 3-10. The system described herein creates a three-dimensional model w^hich is rendered 
to produce the display data added to the image space. There are many ways to layout and to 
represent alphanumeric character strings, and the invention is not limited to those described 
10 herein. The following description provides an example implementation for representing a 
character as a set of polygons generated from a character code, font and other properties. 

A computer system for implementing the system of Figs. 1 and 2 as a computer 
program may include a main unit connected to both an output device which displays 
information to a user and an input device which receives input fi-om a user. The main unit 
15 may include a processor connected to a memory system via an interconnection mechanism. 
The input device and output device also are connected to the processor and memory system 
via the interconnection mechanism. 

It should be understood that one or more output devices may be connected to the 
computer system. Example output devices include a cathode ray tube (CRT) display, liquid 
20 crystal displays (LCD) and other video output devices, printers, communication devices such 
as a modem, storage devices such as disk or tape, and audio output. It should also be 
understood that one or more input devices may be connected to the computer system. 
Example input devices include a keyboard, keypad, track ball, mouse, pen and tablet, 
communication device, and data input devices such as audio and video capture devices. It 
25 should be understood that the invention is not limited to the particular input or output devices 
used in combination with the computer system or to those described herein. 

The computer system may be a general purpose computer system which is 
programmable using a computer programming language, such as AC++,@ JAVA or other 
language, such as a scripting language or even assembly language. An example computer 
30 system is the Infinite Reality computer system from Silicon Graphics, Inc. The computer 
system may also be specially programmed, special purpose hardware, or an application 
specific integrated circuit (ASIC). In a general purpose computer system, the processor is 
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typically a commercially available processor, of which the series x86 and Pentium series 
processors, available from Intel, and similar devices from AMD and Cyrix, the 680X0 series 
microprocessors available from Motorola, the PowerPC microprocessor from IBM and the 
Alpha-series processors from Digital Equipment Corporation, and the MIPS microprocessor 

5 from MIPS Technologies are examples. Many other processors are available. Such a 
microprocessor executes a program called an operating system, of which WindowsNT, 
Windows 95 or 98, IRIX, UNIX, Linux, DOS, VMS, MacOS and OSS are examples, which 
controls the execution of other computer programs and provides scheduling, debugging, 
input/output control, accounting, compilation, storage assignment, data management and 

10 memory management, and communication control and related services. The processor and 
operating system defines computer platform for which application programs in high-level 
progranuning languages are written. 

A memory system typically includes a computer readable and writeable nonvolatile 
recording medium, of which a magnetic disk, a flash memory and tape are examples. The 

1 5 disk may be removable, known as a floppy disk, or permanent, known as a hard drive. A disk 
has a number of tracks in which signals are stored, typically in binary form, i.e., a fomi 
interpreted as a sequence of one and zeros. Such signals may define an application program 
to be executed by the microprocessor, or information stored on the disk to be processed by 
the application program. Typically, in operation, the processor causes data to be read from 

20 the nonvolatile recording medium into an integrated circuit memory element, which is 
typically a volatile, random access memory such as a dynamic random access memory 
(DRAM) or static memory (SRAM). The integrated circuit memory element allows for faster 
access to the information by the processor than does the disk. The processor generally 
manipulates the data within the integrated cfrcuit memory and then copies the data to the disk 

25 after processing is completed. A variety of mechanisms are known for managing data 

movement between the disk and the integrated circuit memory element, and the invention is 
not limited thereto. It should also be understood that the invention is not limited to a 
particular memory system. 

Such a system may be implemented in software or hardware or firmware, or a 

30 combination of the three. The various elements of the system, either individually or in 
combination may be implemented as a computer program product tangibly embodied in a 
machine-readable storage device for execution by a computer processor. Various steps of the 
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process may be performed by a computer processor executing a program tangibly embodied 
on a computer-readable medium to perform functions by operating on input and generating 
output. Computer programming languages suitable for implementing such a system include 
procedural programming languages, object-oriented progranmiing languages, and 

5 combinations of the two. 

It should be understood that invention is not limited to a particular computer platform, 
particular processor, or particular high-level programming language. Additionally, the 
computer system may be a multiprocessor computer system or may include multiple 
computers connected over a computer network. It should be imderstood that each module or 

10 step shown in the accompanying figures may correspond to separate modules of a computer 
program, or may be separate computer programs. Such modules may be operable on separate 
computers. 

In one embodiment, an OpenGL software library (OpenGL is a registered trademark 
of Silicon Graphics, Inc.) which is accessible through an application program interface (API), 

15 is used to implement commands that specify objects and operations to produce interactive, 
three-dimensional applications. In a computer system which supports OpenGL, the operating 
system and application programs can make calls to the computer graphics system according 
to the standardized API without knowledge of the underlying hardware. The OpenGL 
standard provides a library of graphics manipulation commands for describing models of 

20 three-dimensional objects. The OpenGL standard is described in the OpenGL 

Programming Guide , Version 1.1 (1997), the OpenGL Reference Manual , Version 1 . 1 
(1997), and The OpenGL Graphics System: A Specification (Version 1.0), by Segal and 
Akeley, all of which are hereby incorporated by reference in their entirety. 

Referring now to Fig. 3, in one embodiment a data structure which represents a titling 

25 effect to be applied to video is a scene graph. The implementation may be object oriented. 
The scene graph object may be implemented as a tree or other structure such as a list, array, 
etc. An example of such a tree is shown at 50 in Fig. 3. The tree has a root node 52. The 
root node has a collection of children nodes which may be, for example, a text box 54 or 
shape 56 or page deck 55. The root node also has associated with it a collection of property 

30 stacks which will be described in more detail below. 
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A shape object is any arbitrary closed two-dimensional sh^e, such as a rectangle, 
circle or triangle. Accordii^ly, a shape node 56 is represented by data defining the shape, 
such as a B^zier. 

A text box object 54 is defined as a collection of characters, which may be 
5 representii^ using any data structure, such has a tree. A text box may be implemented to 
include other text boxes. A text box also has an associated property list, as described below. 
A text box, for example, may have width and height properties. 

Another kind of object is a page deck object 55. A page deck is a collection one or 
more pages 57, 59. A page may include other kinds of objects such as text boxes, shapes or 
10 other page decks. A page deck has a display method that displays its children one at a time. 
A child is selected according to a function of time. 

Each character object , such as shown at 58, 60, 61 and 63, is represented by a 
character code and a property list. The character code may be an ASCH code defining the 
alphanumeric character or symbol. 
15 Glyph objects, illustrated at 62, 64, 66 and 68, are dynamically created from character 

and shape objects according to a code for a character or shape object and associated 
properties. A glyph defines commands which may be used to render a character on a display. 
Glyphs for different characters, but having the same font name, profile, and rendering mode 
may be combined into glyph sets. 
20 In one embodiment, a glyph is implemented as a display list in the OpenGL s ta n dar d, 

which is created by rendering the set of polygons defining a character or shape, such as 
described below in connection with Fig. 5. By using a display list, three-dimensional 
rendering of a set of polygons may be accelerated by caching the display list for the character 
as a glyph Such caching is maintained by a glyph manager 70. 
25 The glyph manager 70 is a repository, such as a database or a data structure, that 

stores glyphs created for a combination of character code, font, profile, and rendering mode 
and a level of detail. Fonts with different style attributes that alter the outline of the 
character, such as bold and itahc, are treated as separate fonts. The profile corresponds to 
beveling or other border properties that also alter the outline of the character. The rendering 
30 mode includes wire frame or non-wire frame rendering. The level of detail (LOD) represents 
a particular polygonal tessellation of the glyph. Glyphs having the same font, profile, 
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rendering mode and LOD are combined into a glyph set. A glyph set may include a glyph for 
characters in a specified font with a specified profile, rendering mode and level of detail. 

Referring now to Fig. 4, the glyph manager 70 stores glyphs 202 in glyph sets 70. An 
indication of a font, profile, rendering mode and LOD 204 identify a glyph set. In response 

5 to an indication of a font, profile, rendering mode and LOD, a glyph set 200 may be 

identified or maintained by the glyph manager. A reference or pointer to a glyph set may be 
returned by the glyph manager as shown at 206. To retrieve a glyph 210 from a glyph set 
200, a character code 212 and the reference 206 to the glyph set are used. In one 
embodiment, the glyph set may be implemented as an object having a draw method that is 

10 passed an indication of a character code. Given the character code, the glyph set draws the 
glyph for the indicated character code. In a computer system such as a described above, in 
particular the Infinite Reality computer system, an entire glyph set 200 may be loaded into a 
memory associated with the graphics processor in order to accelerate rendering. 

Properties are values which control the appearance and position of nodes in a scene 

15 graph. These values may be animated in a video presentation. In particular, the value of a 
property may be a fianction of time or may be a function of position of the object along a 
path. There are several kinds of properties that may be associated with an object, which may 
be grouped together in any number of ways. Material or surface properties are used to color 
an object. Shadow properties define the color and nature of an object's shadows. Border 

20 properties define the material and nature of an object's border, such as an outline. Lighting 
properties define how lighting is directed at objects. Effect properties derme special effect 
attributes for rendering the object. Font properties define the aspects of the font used for 
character objects. Transform properties define various spatial transforms applied to the 
object. Layout properties define how objects relate to each other. Some properties, as noted 

25 above, affect the glyph set in which a character belongs. Other properties, such as those that 
define affine transforms affect the projection of a rendered object onto a display. Using 
OpenGL, these properties may be used to alter the modelview matrix. 

Example material or surface properties include color, opacity, shininess, texture 
pattern or image, texture mapping, lighting, overlapping, and features affecting textures, such 

30 as tiling, tinting, mapping, scale, offset and rotation. Example shadow properties include the 
type of shadow, shadow material, angle of the shadow, distance from the object, softness, 
color and opacity of the shadow. Example border properties include the type, such as outline 
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or bevel, material, thickness and bevel smoothness. Example lighting properties include 
direction, offset, color, intensity and size. Example effect properties include delay indicating 
a time of wait until a next object is drawn, any transition used to place the object on a screen, 
such as a dissolve or wipe, the depth of extrusion, whether objects should be rendered as wire 

5 frames, or other information. Example font properties include the font family such as Times 
or Helvetica, or font name, the font size, and style attributes such as bold, italic and 
underline. Example transform properties include the position relative to any parent object in 
the scene graph, the center of rotation and scaling with respect to the position, an amount of 
scaling and amount of rotation, and any anchor point. Such transform properties may be 

10 specified in any coordinate space, such as Cartesian coordinates. Example layout properties 
include the amount of additional kern between objects, the amount of additional leading 
above objects, the justification or alignment, margins, and scroll position. 

For each type of property, a property stack is maintained. The list of stacks 72 is 
maintained as part of the root object 52. A stack is used to track the active properties for a 

1 5 node during traversal of the scene graph. Each node in the scene graph may have a property 
list such as shown at 74. A property list is a list of properties and associated values for each 
node which differ from default values in the property stacks associated with the root node. If 
a property is not in the property list of a given node, the property is inherited by that node 
from any ancestor node in the scene graph. If a node has a property in its property list, that 

20 node overrides the property value from the ancestor nodes. For example, property list 74 
includes values 76 for font height and 78 for opacity. However, it does not include a value 
for the property font type. Therefore, node 61 inherits its font type from ancestor nodes text 
box 54 or root 52, but overrides the font height and opacity values. Each character also may 
have its own properties that is may inherit or that it may define as a function of time or 

25 position along a path. 

A property value may be defined by a function based on a normalized representation 
of time that refers to the duration of the scene graph. In particular, the property values may 
be defined as a function of time, where time ranges in value from zero to one. This function 
may be represented by a Bezier curve. Different values may be associated with the property 

30 over time to permit animation of the property. In a graph of a property value, where a 
horizontal ^is represents time and a vertical axis represents a value, a constant value is 
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represented by a horizontal line. A graphical user interface that displays the Bezier curve 
may permit a user to manipulate the curve to change these values. 

A property value also may be defined by a function based on a normalized position of 
an object along a defined path. The position of objects, such as characters, along the path 

5 may be animated over time. In particular, the property values may be defined as a function of 
a position along the path, where the position ranges in value from zero to one. The path may 
be represented by a Bezier curve. Different values may be associated with a property along 
the path to permit animation of the property. In a graph of a property value, v^^here a 
horizontal axis represents a position along the path and a vertical axis represents a value, a 

10 constant value is represented by a horizontal line. A graphical user interface that displays the 
Bezier curve defining the property may permit a user to manipulate the curve to change these 
values. 

A path may be specified in a text box object as a path object. A Bezier curve may be 
used to define a path. Other structures also may be used to defined paths. In addition to the 

1 5 structure used to defined a path, a path object has at least two properties: a baseline offset 
and an orientation. Baseline offset, described in more detail below, specifies an offset of the 
characters of text firom the path. The orientation specifies the direction of the text characters 
on the path. For example they may be oriented vertically, or perpendicular to the path or 
parallel to the path. A path may be specified by a user using standard techniques for defining 

20 curves in three-dimensions. 

The position of the text on the path is based on whether the text is static on or scrolls 
along the path. If the text is static on the path, the text may be aligned along the path, for 
example by being left aligned, center aligned, right aligned, justified, or equally spaced. If 
the text is scrolled along the path, a scroll position property may be defined to vary over time 

25 specified for the text. A scroll position indicates an offset from a start of a path. For a 

rectangular path, the start of the path is the upper left comer. For elliptical paths, the start of 
the path is the top of the ellipse. For all other shapes, the start of the path is the first control 
point created to define the shape defining the path. 

As noted above, text also may be offset firom a path. For example, a baseline offset 

30 value may indicate the position from the path of the location of the text. A value of zero for 
the baseline offset indicates that the text is exactly along the path. Values greater than zero 
shift the text above the path, whereas values less than zero shift the text below the path. The 
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position of a character along a path generally is independent of the offset The ofi&et for a 
character may be independent of or a function of the position along the path. 

As mentioned above, when processing the scene graph either to position characters or 
to render characters, the properties associated with a node are pushed onto the property stacks 
when processing that node and its children After completion of the processing of a node and 
its children, the properties are popped from the property stacks. 

In order to process a scene graph and in order to display its contents, the scene graph 
is traversed using a simple "in order" or "depth first" traversal. The graph may be first 
traversed to perform a layout fiinction, i.e., positioning of characters. Where the text is 
placed along a path, the layout is described in more detail below in connection with Fig. 10. 
The graph may be next traversed to draw each character on the display or to an image buffer. 
A process for traversing the tree to identify the property values for each node and character in 
the tree will be described now in connection with Fig, 5. 

Fig. 5 illustrates pseudocode for implementing this procedure. The procedure begins 
with processing of the root node, which has associated property stacks. The root node has its 
own property list that defines system-wide default values. The property list also may be 
changed to provide values which differ from system-wide default values. For each property 
in the node's property Ust, the property is pushed onto the corresponding property stack in 
step 80. The node is then processed, to either layout or draw the contents of the node in step 
82. For each child of the current node, as determined in step 84, this procedure is recursively 
performed as identified at 86. After completing processing of all the child nodes, for each 
property in the property list of the current node, as determined in 88, the property values are 
popped fix»m the corresponding property stacks in step 90. This process of referencing 
properties by a property stack during tree traversal implements property inheritance and 
overriding. 

In orda: to display the contents of a text box, two functions are performed. Fust, the 
text is laid oxit in a plane or space defined by its parent node, by selecting position for each 
character based on the spatial properties of the character. The characters may be laid out 
along a path in this plane or space, and the spatial properties of a character (as well as other 
properties) may vary with its position along the patii. Processes for laying out characters of 
the same size and orientation in two-dimensions are well-known in the art A process for 
laying out characters along a path where the positions of the characters along the path affect 
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their properties (in particular, size, orientation and other spatial properties) is described in 
more detail below in connection with Fig. 10. The characters are then drawn in their selected 
positions. The plane or space into which the characters are drawn may be transformed for 
display. The layout and drawing operations may be performed for each character separately 

5 or for all characters in a string at one time. 

The drawing of three-dimensional text characters represented by polygons will now 
be described in connection with Fig. 6. Fig. 6 is a flow chart describing the process of 
rendering a character. This process begins by a user specifying a character code, font, profile 
and rendering mode in step 100. The font, profile and rendering mode may be selected from 

10 the property stack at the node for this character. 

As described above, the character node defines a character code representing the 
character, such as a Unicode value, a standard maintained by the Unicode consortium, an 
ASCII code, or a code defined in ISO-8859-1 (Latin), ISO-8859-7 (Greek), ISO 8859-5 
(Cyrillic), ISO-8859-8 (Hebrew), or ISO 8859-6 (Arabic), or any other character code 

15 standard. Typel fonts (from Adobe Systems, Inc.), TrueType fonts (from Microsoft 

Corporation) and Bitstream fonts (fi-om Bitstream, Inc.), or any other font defined by contour 
definitions, for example by using B6zier splines, may be used. 

A glyph may be available for the specified character, font, profile and rendering mode 
in the glyph manager 70. If it is determined in step 108 that a glyph is available in the glyph 

20 manager for the specified character, the glyph is retrieved in step 1 12 from the glyph 
manager. The glyph is then executed in step 1 14. 

If a glyph is not available for the specified character, font, profile and rendering 
mode, the contours or curves defining the character in the specified font are retrieved using 
the specified character code, in step 102. 

25 The curves defining a character are then subdivided in step 104 into a set of line 

segments. The subdivision of Bezier curves is described, for example, in Computer Graphics 
Principles and Practices , Second Edition, by James D. Foley, Andries van Dam, Stephen 
Finer and John Hughes, Addison- Wesley Publishing Company, 1990, pages 507-51 1, which 
is hereby incorporated by reference. The set of line segments define an outline of the 

30 character. 

Changes in the height and size of a character may affect the number of polygons 
created by tessellation (in the next step 106 in Fig. 6) in order to make the resulting image 



wo 00/63848 



-15- 



PCT/USOO/09944 



look smootL In particular, sixiall characters have a low level of detail and few polygons. 
Large characters have a high level of detail and many polygons. The level of detail may be 
increased by recursively subdividing the line segments according to a desired level of detaiL 
Such recursive subdivision often is \ised to improve accuracy of an approximation of a 
polygon to a surface. Such subdivision techniques are described in the OpenGL 
Programming Guide, Version 1.1, Second Edition, pp. 86-89, which is hereby incorporated 
by reference. A particular embodiment for determining a desired level of detail is described 
in more detail below in comiection with Fig. 7. The line segments obtained by subdividing 
the curves are continually subdivided if two adjacent line segments have an angle that 
exceeds a threshold determined by the desired level of detaiL 

The polygons defining the outline of a character are then tesselated in step 106. 
Tessellation is a fimction provided by the OpenGL apphcadon programming interface. This 
function may be used with a parameter "glu_tessjtolerance," vAddi is set to zero. 
Tessellation fills spaces defined by the line segments with polygons, typically triangles, and 
lines, to provide a set of polygons defining a sluq)e of a character. The set of polygons 
resulting from tessellation then may be rendered in step 110. If rendered using OpenGL, a 
display list may be created and stored in the glyph manager. The display list may be 
executed in step 1 14. 

Prior to tessellating the polygons defining a character, a two-dimensional outline of a 
character may be transformed into a solid three-dimensional object by applying a profile to 
the character outline, A profile is defined in a plane perpendicular to the plane of the two- 
dimensional outline. The compxiter system sweeps the profile along the character outline to 
produce a three-dimensional shape. These techniques of profiling and sweeping are standard 
three-dimensional modeling techniques. For example, a circular profile produces a three- 
dimensional tubular character after sweeping. 

Referring now to Fig. 7, the determination of the threshold used to control subdivision 
of line segments in step 104 of Fig, 6, is based on a value representing a level of detail. In 
particular, the tolerance value (T) is equal to two to the power of the level of detail (LOD) 
value (T=2L0D). In this embodiment, level zero is the maximxmi level of detail. Increasing 
integers correspond to lower levels of detail. The tolerance depends on the size of the 
resulting image in pixels, the height of the font, relative to the project height, any scaling 
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factor, and a user provided value of quality. This value may be provided as any value greater 
than zero, where one is a nominal quality, or defiault value. 

In the embodiment shown in Fig, 7, the detenxunation level of detail is based on the 
following assumptions; the font height is a value of zero to one and is relative to the project 
height; the project height is measured in pixels; and the scaling fector by which B6zier curves 
are input into the system is 4096, The scaling factor allows polygon vertex values to be two 
byte scaled integers. 

A first step of computing the level of detail is setting the level of detail value to zero 
in step 300. A size value is then computed as the product of the font height and flie scale 
factor in step 302. A value called "p" is the B6aer curve scaling factor, e.g., 4096, multiplied 
by one half, then divided by the project height, and then by the quality value in step 304, If 
the value p is greater than the size value, as determined in step 306, processing is completed 
as indicated at 308 and the current level of detail value is xised to compute the tolerance for 
the subdivision process. Otiierwise, the value p is multiplied by one half in step 310 and the 
level of detail value is incremented in step 312, Steps 306 through 312 repeat until the value 
p is greater than the size value. 

It should be understood that other methods may be used to determined the tolerance to 
which the subdivision step of 104 uses, and that this invention is not limited to the example 
embodiment set forth herein. 

The properties for a scene may be animated over time. In order to determine a 
particular value to be used in a given image of a sequence of temporally related images, the 
property selected for example by using a property value selector shown at 120 in Figure 8. 
An image may be a frame or a field The property value selector receives an effect time 
duration 122 which is defined by an editor using for example the video editing system to 
apply the effect to one or more video tracks. The image rate of the associated video is also 
input at 124. The property value is defined by a property specification 126 which may be a 
B&ier curve having values associated with a time scale ranging from zero to one. The 
property value selector 120 multiplies the effect time duration by the image rate to determine 
the number of images over which the effect is applied. The ratio of the current time 128 (in 
images with respect to a start image ) to the total number of im^es in the effect defines a 
time value between zero and one which is used to identify a property value 130 according to 
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the property specification 126. For any given image, property value 130 is used to define the 
displayed characters. 

The properties for a scene also may be animated according to the position of each 
character along a path. Path dependent property values may be specified in a similar manner 

5 as time dependent property values. The same interface may be used for defining the property 
values over both time and position. The properties to be varied are associated with the path 
to which they are applied. Accordingly, a Bezier curve may be used, for example, to 
represent the dependency of a property on a position along a path. An interpretation of these 
values as either property or time dependent may be specified using a rendering mode having 

10 two states: time or position. Some values may be defined as path dependent. Others may be 
defined as time dependent. Others may be defined so as not to vary. 

For static and aligned text along a path, the position of a particular character may be 
determined once for the duration of the effect. A path also may have scroll property that 
varies over time. The scroll property may be defined in the same manner as other time 

1 5 dependent properties. Given the calculation of the scroll position for a given image in a 

sequence of temporally related images, the characters may be laid out according to the scroll 
position and their position dependent properties. 

Referring now to Fig. 9, how various properties for a character may be animated 
according to the position of the characters on the path will now be described. To determine 

20 which value for a property to use for a character in an image of a sequence of temporally 

related images, the property is selected, for example by using a position-based property value 
selector 900 as shown in Fig. 9. The position based property value selector 900 receives a 
nomialized position 902 for each character laid out on the path. Each character may have its 
own properties defined in this manner to override inherited properties. The property value 

25 906 for the character is selected according to the normalized position 902 from a property 
specification 904 defined for character objects or the path object. The normalized position 
902 represents a value from zero to one along the path. The property specification generally 
has values associated with a position scale ranging from zero to one. This property 
specification may be the same property specification 126 (Fig. 8) that has values associated 

30 with a time scale ranging from zero to one, but used for a different purpose. 

The normalized position 902 is determined by a layout module 912 using a starting 
point 908 for the character string to be placed on the path and the character string 910. The 
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starting point 908 is determined by a starting position selector 914, that uses the effect time 
duration 916, current time 918 and frame rate 920 as described above in connection with Fig. 
8 with a time dependent scroll position specification 922. If the text is to crawl or otherwise 
scroll along the path, then the scroll varies according to time and is computed for each image 
5 in a manner similar to other properties described above in connection Fig. 8. Otherwise, the 
starting point may be static. If the scroll node is static, the current scroll position is set to 
zero. If the path is a closed path, the current scroll position is determined by multiplying the 
effect time duration by the image rate to determine the number of images over which the 
effect is supplied. The ratio of the current time (in images with respect to a start image) to 
10 the total number of images in the effect is used to fmd a time value between zero and one 
defining the point in time at which the current image occurs in the effect. This ratio is 
multiplied by the path length to obtain a current scroll position. If the curve is not closed, 
additional factors are applied to compute the scroll position to prevent edges of characters 
from being viewed if the scroll position is near zero or one. An example formula is expressed 
15 by the following equation: 

scroll position = -(text width * 1.01) + 1 * (text width + path length) * 1.005, 
Because one of the properties of characters that may vary over time is size or any 
other spatial property, which in turn affects position, property values 906 for a character may 
be fed back to the layout and rendering module 912 to adjust a character ' s position. How 
20 characters in a character string are laid out given a path and starting point will now be 
described in more detail in connection with the flow chart of Fig. 1 0. 

One difficulty in laying out text that is scrolling and has position dependent properties 
is that the width of text may be a function of its position, and its position similarly is a 
function of its width. To simplify this process, as shown in Fig. 10, the process begins by 
25 identifying the current scroll position in step 1000. This position is used to determine the size 
of the first character in step 1002. An initial position of the next character is determined in 
step 1 004 by adding the width of the current character to the current position plus any spacing 
specified by keming or justification. This position is used to determine the size of the next 
character in step 1006. The initial position of the next character is then adjusted in step 1008 
30 such that the distance specified by keming or justification between this character and the 
previous character is maintamed. Steps 1006 and 1008 may be repeated until the adjustment 
amount is below a specified threshold as indicated at 1010. The orientation of the character 
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on the path is determined by calculating the tangent to the path at the detenained position. If 
more characters remain to be laid out as determined in step 1012, the process continues by 
repeating steps 1004 through 1012. Otherwise the process is completed on processing the 
last visible character on the path. If the path is closed, all characters are displayed. If the 

5 path is open, and if the current position of a character is defined by reference to the path, 
whether a character is on the path may be readily determined from its detennined position. It 
should be understood that a standard linear layout of characters may be used where none of 
the position dependent properties affect the width or spacing between characters. 

Having now described the operation of the three-dimensional layout and rendering 

10 module 34 (Fig. 1), the graphical user interface 20 will now be described in more detail. It 
should be understood that the followdng is only an example interface and that many other user 
interfaces may be provided for editing text for use in rendering three-dimensional titling 
effects. A timeline 32 (Fig. 1) is shown in Fig. 1 1 at 140, The timeline may have one or 
more video or audio tracks 142 as is commonly provided in computer-based non-linear 

15 editing systems such as the Avid/1 Media Composer from Avid Technology, Inc. A titling 
track 144 also is provided to define titling effects or other character generation effects. A 
titling effect on the timeline, such as shown at 146, has a duration which may be adjusted by 
manipulation of the user interface using known techniques. The duration of the titling effect 
on the timeline interface may be maintained in a composition representing the program and 

20 may be used as described above in connection with Figs. 8 and 9. 

If an icon for a text box is selected in the timeline shovm in Fig. 1 1 , an editing box 
may be provided to edit the text in the text box as shown in Figs. 12 and 13. In particular, the 
graphical user interface for editing text in a text box may operate in two modes. 

In a first mode, shown in Fig. 12, the text display area 160 is similar to a word 

25 processor. A ruler tool 162 may be used to identify spacing of the text. A scroll bar 164 may 
be provided to view text that does not fit within the text display area 1 60. A cursor 1 66 may 
be provided in this editor. The characters actually displayed in text display area 160 may be 
rendered using the three-dimensional techniques described above to provide a "what-you-see- 
is-what-you-get" (WYSIWYG) interface, without positioning in the text display area 160 that 

30 is effected by rolling or crawling. This mode may be activated by, for example, selecting the 
text display area 160 using a pointing device, such as a mouse. 
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In response to a user selecting an area outside of the text display area with the 
pointing device 166 or providing a predetermined input, such as a specified key on the 
keyboard, an alternate mode for the purpose of animation is shown, as illustrated in Fig. 13. 
In this mode, the display area 168 displays the text, with three-dimensional rendering as it 
5 appears at a selected time in the effect with spatial effects, such as rolling or crawling 

applied. The selected time in the effect is defined by a key fi^e control 170. The key ftame 
control defines a range from zero to one over the effect A point in time in the effect, to have 
rendered in the display 168, may be selected using a key ftame indicator 172. Alternatively, 
the key frame control 170 may be horizontal or vertical m another position, or in another 
10 orientation, to allow key framing of a roll or crawl or other effect position. 

Because a character is represented by a set of polygons instead of a bit-map, a 
character maybe manipulated using- three-dimensional animation techniques. Ajoimation 
properties that use a normalized scale over time pennits animation that is independent of the 
spatial and temporal resolution of the video to which the animated characters are applied. 
15 The storage of sets of polygons created from contours defining characters in a font 

defines a new representation of the font and allows new fonts to be created. The sets of 
polygons for characters of various fonts may be stored both to improve performance and for 
later retrieval to avoid recalculation. 

Example position dependent animation of objects on a path will now be described in 
20 connection with Figs. 14 and 15. Fig* 14 illustrates how the size of text may be modified 
with position along the path. Fig. 14 shows a path 1400 on which the text *text effects" 1402 
is laid out. As the text moves from right to left, the size of each character changes. For a 
given image in the effect, the position of each character, such as the '*x," is dependent upon 
the curve 1404 specifying the object size. This scale function curve 1404 specifies 10% scale 
25 at position 1 (at 1406), a 100% scale at position 0.5 (at 1408) and 10% scale agam at position 
0 (at 1410) with linear ramps 1412 and 1414 between these three points. 

Fig. 15 illustrates a rotation in the Y axis. The path 1500 has associated with it the 
characters "text effect," at 1 502. As die text moves from left to right, the orientation of each 
character changes. A rotation from 0-180° is specified linearly by the property curve 1 506. 
30 For any given character, such as the **x," its rotation in the y-axis is determined by identifying 
the position along the path 1500, from which its corresponding rotation value can be 
determined 1506. 
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It also is possible to specify a second path associated with a first path that determines 
the height of each text object by the distance between the two paths. Such paths may have 
various configurations. Forexample, the paths may be parallel or non-parailel. In either 
orientation, each character may be placed along and between the paths (i.e., parallel to the 
5 path). Alternatively, the text may be laid out perpendicular to and between the paths. 

Parallel paths 1600 and 1602 are shown in Figs. 16A and 16B. In these embodiments, 
the text may be laid out linearly because the height 1604 of each character is not position 
dependent. If the paths are not straight lines, such as shown at 1606 and 1608 in Fig. 16B, 
the position of each character may be used to determine a distortion lattice for the character. 

10 Such a distortion lattice may be generated by subdividing and transforming a rectangle 

defined about the character's position along the paths. In particular, a tangent at the position 
of the character along the path is computed. A box having a base along this tangent to the 
path at the character's position is defined, A segment of the path having end points defined 
by a projection of the box on the path is then used to transform the box to provide the 

15 distortion lattice. Such a distortion lattice, 1610 in Fig. 16C, may be combined with an 
undistorted model of the character 1612 to produce a distorted geometry 1614. The 
application of distortion lattices to three-dimensional models is well known in the art. 

As shown in Fig. 17, text 1704 may be laid out perpendicular to and between two 
paths 1700 and 1702. The distance 1706 between the two paths may determine the size, i.e., 

20 the width, of each character of the text. Letters along the edges of the text, e.g., letter 1 708, 
may be distorted according to a distortion lattice. The text may be laid out linearly if the text 
height is not position dependent. If the text height is position dependent, the text may be laid 
out in the maimer described above in connection with Fig. 10. 

Referring to Fig. 18, two non-parallel paths 1800 and 1802 also may be used to define 

25 scaling and rotation if the text is laid out along the path rather than perpendicular to the path. 
In this embodiment, scaling properties are determined by the angle of a line defining the 
shortest distance between a point on the bottom path and a point on the top half. In this 
embodiment, the width of a character is not position dependent because the height of the 
character is specified by the distance between the two paths. In this embodiment, the layout 

30 process described above in connection with Fig. 10 may be used with the following 

modifications. After an initial a position for a character is determined, the character's height 
at that position is determined fi:om which its width may be determined. The position may be 
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adjusted such that the spacing between characters i;emains accurate. Alternatively, the aspect 
ratio of the character may be adjusted as part of defining a distortion lattice for the character 
at that position. 

It should be understood that there are many ways to specify one or more paths and 
how text may appear along one or more paths. The foregoing illustrations are merely 
examples of the kinds of effects that may be made by using time and position dependent 
properties of text characters. 

Having now described a few embodiments, it should be apparent to those skilled in 
the art that the foregoing is merely illustrative and not limiting, having been presented by way 
of example only. Numerous modifications and other embodiments are within the scope of 
one of ordinary skill in the art and are contemplated as falling within the scope of the 
invention. 
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CLAIMS 

1 . A method for producing three-dimensional animation of objects along a path, 
comprising the steps of: 

receiving an input defining an outline of the object; 
5 associating properties with the object and the path defining three-dimensional 

attributes of the object along the path; 

generating a set of polygons defining a surface of the object using the received outline 
and properties; and 

rendering the set of polygons according to the properties along the path. 

10 

1, The method of claim 1 , wherein the step of generating comprises the steps of: 

converting curves defining the outline into a set of connected line segments; and 
tessellating the set of connected line segments to obtain the set of polygons. 

15 3. The method of claim 2, wherein the object is an alphanumeric character. 

4. The method of claim 1, wherein the object is an alphanumeric character. 

5. A computer system for producing three-dimensional animation of objects along a 
20 path, comprising: 

means for receiving an input defining an outline of the object; 

means for associating properties with the object and a path defining three-dimensional 
attributes of the object along the path; 

means for generating a set of polygons defining a surface of the object using the 
25 received outline and properties; and 

means for rendering the set of polygons according to the properties on a display along 
the path. 

6. The computer system of claim 5, wherein the means for generating comprises: 
30 means for converting curves defining the outline into a set of connected line 

segments; 

means for tessellating the set of connected line segments to obtain the set of polygons. 
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7. The computer system of claim 6, wherein the object is an alphanumeric character. 

8. The computer system of claim 5, wherein the object is an alphanumeric character. 

9. The computer system of claim 5, further comprising: 
means for editing the path. 

10. The computer system of claim 5, further comprising: 

means for defining a second path, wherein the object is placed between the path and 
the second path and has a size defined by a distance between the path and the second path. 

1 1 . The computer system of claim 10, wherein the path and the second path are parallel. 
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PROC VisitNode (Node n) 
Property p; 

FOR EACH p in n's list DO 

Push p onto its corresponding property stack ' 

END DO 

82 

ProcessThisNode (layout or draw) ^ 

84 

FOR EACH child (c) o n DO — ^ 

86 

VisitNode (c);^ — ' 

END DO 88 
FOR EACH p in n's property list DO 90 
Pop p's corresponding property stack^ ' 

END DO 
END PROC 
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